AITopics | cumulative utility

How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.

artificial intelligence, learner, machine learning, (19 more...)

arXiv.org Artificial Intelligence

1909.13861

Country: North America (0.28)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.88)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback

66820ab16b817d8a6b00d60b3d24b83a-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 19:59:39 GMT

artificial intelligence, fictitious play, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California > Orange County > Irvine (0.04)
North America > United States > Maryland > Baltimore (0.04)
(3 more...)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Strategizing against No-regret Learners

Yuan Deng, Jon Schneider, Balasubramanian Sivan

Neural Information Processing SystemsOct-3-2025, 04:32:05 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, learner, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America (0.28)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)

Add feedback

NoveltyBench: Evaluating Language Models for Humanlike Diversity

Zhang, Yiming, Diddee, Harshita, Holm, Susan, Liu, Hanchen, Liu, Xinyue, Samuel, Vinay, Wang, Barry, Ippolito, Daphne

arXiv.org Artificial IntelligenceAug-12-2025

Language models have demonstrated remarkable capabilities on standard benchmarks, yet they struggle increasingly from mode collapse, the inability to generate diverse and novel outputs. Our work introduces NoveltyBench, a benchmark specifically designed to evaluate the ability of language models to produce multiple distinct and high-quality outputs. NoveltyBench utilizes prompts curated to elicit diverse answers and filtered real-world user queries. Evaluating 20 leading language models, we find that current state-of-the-art systems generate significantly less diversity than human writers. Notably, larger models within a family often exhibit less diversity than their smaller counterparts, challenging the notion that capability on standard benchmarks translates directly to generative utility. While prompting strategies like in-context regeneration can elicit diversity, our findings highlight a fundamental lack of distributional diversity in current models, reducing their utility for users seeking varied responses and suggesting the need for new training and evaluation paradigms that prioritize diversity alongside quality.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2504.05228

Country:

Europe (1.00)
North America > United States > Pennsylvania (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Remember, but also, Forget: Bridging Myopic and Perfect Recall Fairness with Past-Discounting

Kumar, Ashwin, Yeoh, William

arXiv.org Artificial IntelligenceApr-1-2025

Dynamic resource allocation in multi-agent settings often requires balancing efficiency with fairness over time--a challenge inadequately addressed by conventional, myopic fairness measures. Motivated by behavioral insights that human judgments of fairness evolve with temporal distance, we introduce a novel framework for temporal fairness that incorporates past-discounting mechanisms. By applying a tunable discount factor to historical utilities, our approach interpolates between instantaneous and perfect-recall fairness, thereby capturing both immediate outcomes and long-term equity considerations. Beyond aligning more closely with human perceptions of fairness, this past-discounting method ensures that the augmented state space remains bounded, significantly improving computational tractability in sequential decision-making settings. We detail the formulation of discounted-recall fairness in both additive and averaged utility contexts, illustrate its benefits through practical examples, and discuss its implications for designing balanced, scalable resource allocation strategies.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2504.01154

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Missouri (0.05)

Genre: Research Report (0.40)

Industry: Transportation > Passenger (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.91)

Add feedback

AMUSE: Adaptive Model Updating using a Simulated Environment

Chislett, Louis, Vallejos, Catalina A., Cannings, Timothy I., Liley, James

arXiv.org Machine LearningDec-13-2024

Prediction models frequently face the challenge of concept drift, in which the underlying data distribution changes over time, weakening performance. Examples can include models which predict loan default, or those used in healthcare contexts. Typical management strategies involve regular model updates or updates triggered by concept drift detection. However, these simple policies do not necessarily balance the cost of model updating with improved classifier performance. We present AMUSE (Adaptive Model Updating using a Simulated Environment), a novel method leveraging reinforcement learning trained within a simulated data generating environment, to determine update timings for classifiers. The optimal updating policy depends on the current data generating process and ongoing drift process. Our key idea is that we can train an arbitrarily complex model updating policy by creating a training environment in which possible episodes of drift are simulated by a parametric model, which represents expectations of possible drift patterns. As a result, AMUSE proactively recommends updates based on estimated performance improvements, learning a policy that balances maintaining model performance with minimizing update costs. Empirical results confirm the effectiveness of AMUSE in simulated data.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Machine Learning

2412.10119

Country:

Europe > United Kingdom (0.15)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Health Care Providers & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Dynamics of the Ride-Sourcing Market: A Coevolutionary Model of Competition between Two-Sided Mobility Platforms

Ghasemi, Farnoud, Drabicki, Arkadiusz, Kucharski, Rafał

arXiv.org Artificial IntelligenceOct-9-2023

There is a fierce competition between two-sided mobility platforms (e.g., Uber and Lyft) fueled by massive subsidies, yet the underlying dynamics and interactions between the competing plat-forms are largely unknown. These platforms rely on the cross-side network effects to grow, they need to attract agents from both sides to kick-off: travellers are needed for drivers and drivers are needed for travellers. We use our coevolutionary model featured by the S-shaped learning curves to simulate the day-to-day dynamics of the ride-sourcing market at the microscopic level. We run three scenarios to illustrate the possible equilibria in the market. Our results underline how the correlation inside the ride-sourcing nest of the agents choice set significantly affects the plat-forms' market shares. While late entry to the market decreases the chance of platform success and possibly results in "winner-takes-all", heavy subsidies can keep the new platform in competition giving rise to "market sharing" regime.

agent, market share, platform, (15 more...)

arXiv.org Artificial Intelligence

2310.05543

Country: